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Abstract 

The aim of the paper is to derive essential elements of quantum me- 
chanics from a parametric structure extending that of traditional math- 
ematical statistics. The main extensions, which also can be motivated 
from an applied statistics point of view, relate to symmetry, the choice 
between complementary experiments and hence complementary paramet- 
ric models, and use of the fact that there for simple systems always is a 
limited experimental basis that is common to all potential experiments. 
Concepts related to transformation groups together with the statistical 
concept of sufficiency are used in the construction of the quantumme- 



chanical Hilbert space. The Born formula is motivated through recent 
analysis by Deutsch and Gill, and is shown to imply the formulae of el- 
ementary quantum probability/ quantum inference theory in the simple 
case. Planck's constant, and the Schrodinger equation are also derived 
from this conceptual framework. The theory is illustrated by one and 
' by two spin 1/2 particles; in particular, a statistical discussion of Bell's 

inequality is given. Several paradoxes and related themes of conventional 
quantum mechanics are briefly discussed in the setting introduced here. 



1 Introduction. 

Nobody doubts the correctness of quantum mechanics. But the completeness 
of the theory has been debated since Einstein, Podolsky and Rosen Qj first 
raised the issue explicitly in 1935. Now a well known general theorem by Godel 
from 1931 - see |2] - says that every rich enough theory may be regarded as 
incomplete in a certain sense. The difficulty with quantum theory is that it is 
nearly impossible to discuss simply its bordering area, say towards macroscopic 
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theories in general or towards relativity specifically, since the foundation of the 
theory is always presented in purely formal terms. 

Some intuitive notions have of course been developed around quantum me- 
chanics during the years, but it is very difficult to have any immediate un- 
derstanding of a theory starting by stating that an observable - whatever that 
should mean in intuitive terms - is defined as a selfadjoint operator on a complex 
Hilbert space. This is of course an important element of quantum theory, but 
taken as an axiom, it is very formal. 

The other foundation stone of modern physics, special relativity, has a beau- 
tiful basis of simple assumptions. Note that 'simple' here does not mainly mean 
simple in formal mathematical terms, but rather in everyday language: physi- 
cal laws are the same for all observers, and the speed of light is the same. So 
the question is: Is there some possibility of finding a similar simple basis for 
quantum mechanics also? 

The main purpose of this paper is to suggest a foundation of quantum me- 
chanics based on relatively simple concepts like: choice of experiment, statistical 
parameter, symmetry and model reduction. We claim that this approach may 
lead to a conceptual starting point which is more intuitive than the usual one. 
Parts of our goal will also be to develop a theory which brings the statistical 
tradition and the traditions developed in physics closer, ultimately to the extent 
that the two traditions may learn from each other. It should be unnecessary to 
point out, of course, that with an aim as ambitious as that, there will be open 
questions, both technical ones and questions related to the underlying philos- 
ophy and to the interpretation of concepts. The hope is, however, that the 
process started here will continue, and that this process in the end will turn out 
to be of some benefit to both sciences. 

It is well known that there exist a large number of interpretations of quan- 
tum theory; an incomplete list is given by the references PJ ^ [SJ |21 . The 
present article implies a particular statistical interpretation closely related to 
the epistemic view of states to Bohr's original minimalistic view and also 
to the neo-Copenhagen interpretation Our main focus, however, will be on 
trying to derive the theory using simpler, less formal concepts. This will be 
done through a thorough discussion of the structure of the parameter spaces of 
the relevant experiments. 

There are a few related papers in the recent literature. A. Bohr and Ulfbeck 
[in] discuss a foundation of quantum mechanics which is based upon irreducible 
representation of groups, and thus uses symmetry in a way which is similar to 
ours. Caves et al JT] proposes a Bayesian approach to quantum theory based 
upon Gleason's powerful Hilbert space theorem. Here we will avoid taking an 
abstract Hilbert space as a point of departure, but we will arrive at it from a 
rather concrete setting. Finally, Hardy |12j derives quantum theory and prob- 
ability theory in an elegant way from a few reasonable axioms, but using the 
concept of measurement in a simpler way than we do here. Our motivation is 
similar; as Hardy puts it: 'Could a 19th century theorist have developed quan- 
tum theory without access to the empirical data that later became available?' 
Our answer is a conditional yes: Provided that the relevant statistical theory 



2 



had been developed earlier. It must be emphasized that we go further in looking 
at complementary models and in reducing models under symmetry than what 
is common in the present statistical literature. One hope is that this later can 
be justified more explicitly from a prediction point of view. 

On the statistical side McCuUagh J3] is an important connection. That 
paper takes certain applied problems related to the definition of a statistical 
model as a point of departure, but ends up with a rather advanced discussion 
using category theory. In the present paper we will specialize to the use of group 
theory, partly to keep the discussion relatively simple, and partly because the 
quantum world is full of symmetries. It is of interest, though, that category 
theory concepts recently have found a strong position in Isham's ^1 
discussion of quantum theory in cosmology. 

What is then our conclusions concerning the completeness of quantum me- 
chanics? According to our view, quantum theory itself can be interpreted as a 
statistical theory, and is as such reasonably complete. The underlying paramet- 
ric model may or may not be complete in concrete cases, and there may also be 
different models that lead to the same prediction, essentially in the same way as 
different choices of gauge in an electromagnetic field give the same observation. 
The corresponding model parameter may in some sense be related to a hidden 
variable of the kind first rejected by von Neumann ^7], but later defended by 
Bell |18| and others. However, in our view a hidden parameter is a simpler, 
much more flexible and also more adequate concept. Below we will introduce 
the concept of metaparameter, a pure modelling concept which may comprise 
several potential experiments. A metaparameter will not in general take a value, 
in agreement with the Kochen-Specker theorem, but also in agreement with the 
fact that there is a limit to how many parameters you can make inference on 
from limited data in an ordinary statistical experiment. 

A basic attitude behind the present paper is the following: Physics is an 
empirical science, and seeking its foundation one should look at methods and 
model considerations that have proved useful in other empirical sciences. In my 
opinion, too much of this field is dominated by formal mathematics. Mathemat- 
ics is of course important and useful, but the very foundation of physics should 
be simple, and one should then refer to a concept of simplicity which is based 
on science, not necessarily on notions belonging to the mathematical tradition. 

The plan of the paper is as follows: The background and basic concepts are 
introduced in Sections 2-7. In Sections 8-10 the one- and two-particle situations 
are discussed, including a statistical treatment of Bell's inequality. A survey of 
group representation theory and related concepts are given in Sections 11-12. 
Then in Sections 13-15 a construction of the basic Hilbert space for quantum- 
mechanics is made using statistical concepts and symmetry. This is probably 
the most important contribution of the present paper. Various arguments for 
the Born formula are briefiy discussed in Section 16, and in Section 17 it is in- 
dicated how essential elements of quantum mechanics and of quantum statistcs 
may be deduced from this. Section 19 discusses the Lorentz transformation and 
Planck's constant, while an argument for the Schrodinger equation is given in 
Section 20. 
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2 Statistics and quantum theory. 



As is well known, statistical methodology has had applications in most areas 
of empirical science, including experimental physics. Statistical inference is 
based upon a relatively simple paradigm: There is an unknown part of reality 
that we want to learn something about; this is described by a parameter 9. 
Learning is done through making observations y, and in general the act of 
making such observations is called an experiment. A model for an experiment 
is made by postulating probability measures on a sample space S, that is, a 
space connected to potential observations. The model is then given by a class 
of probability measures on S; say {P^(-)}, that is, the measures are indexed 
by the unknown assumed part of reality 9. The observations y are stochastic 
variables, i.e., functions on the sample space S. 

Statistical inference is the art of deducing information on 9 from the observa- 
tions y. The Bayesian school of statistical inference theory assumes in addition 
to the model a prior distribution on the parameter space. 

The Bayesian way of thinking also has a strong position in current quantum 
information theory, see Fuchs |S] and references there. If we interprete the sta- 
tistical parameters as something like quantum theoretical state variables, which 
will be the point of departure for much of the present paper, the distinction 
between quantum information Bayesianism and statistical Bayesianism will be 
relatively small. In statistical theory there exist viewpoints labeled 'objective 
Bayesianism' 19 , which may sound like a contradiction, but which can in fact 
be made to make sense. One version of this, where priors are induced by sym- 
metry groups, is in fact underlying much of the present paper. 

An important distinction between quantum information and statistics is that 
the main application of the Bayesian assumption in statistics is in the inference 
from observations to parameters using the measurement model, that is, the mea- 
sure 

P'^(.) 

on the potential observations. This statistical measurement model is currently 
used routinely and with success in a large number of sciences, medicin, biology, 
social sciences and so on. There is no reason why such a statistical point of view 
shouldn't be relevant to physics also. Also in physics any measurement appa- 
ratus implies uncertainty. In fact we intend to show in this paper that it may 
be very fruitful for the understanding of quantum theory to regard state vari- 
ables as statistical parameter determining the distribution of the observations. 
The tradition in quantum physics has been to concentrate on other aspects 
of the measurement process, namely those envisaged by von Neumanns formal 
analysis. Developing further the approach of the present paper, one can show 
that these latter aspects also can be made consistent with having a distinction 
between state variables and observations, and a measurement model relating 
these. 

We will come back to the measurement model and the observations later in 
this paper; first we will concentrate on the state variables or parameters. These 
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parameters will be important throughout the paper. 

It will be a point of departure that the parameter as such only makes sense 
within the experiment that it is connected to: A certain question is raised by 
the experiment, and the value of the parameter is the ideal answer to that 
question. A new element will later turn up, though, as a consequence of the 
theory: Given the value of the parameter, then the symmetry assumptions, 
the assumption of model reduction and the fact that the same unit always 
is involved, seems to imply the well known Born formula, amounting to the 
following: If another experiment is done on this unit, a probability distribution 
of the relevant parameter for this latter experiment can also be found. In fact, 
it is here that the quantum formalism supplies a completely new elements to 
ordinary statistical inference, valid under the stringent symmetry conditions 
that one finds in the simple microscopic world. The statistical argument behind 
Bern's formula will only be indicated in this paper by summarizing results by 
Deutsch, Gill and others. 

The distinction between the parameter space defining the state of a system 
and the sample space (space of observations) with its estimators is essential for 
the present paper. The distinction is not usually made in physics, but it is crucial 
in statistical inference. We will keep the distinction even in cases with perfect 
observation, where the value of each observation almost equals some function of 
the parameter. This is consistent with current statistical theory, and it may be 
a way to understand better certain paradoxes of quantum mechanics. We will 
also extend the parameter space (state space) to contain state variables that are 
known or assumed known, contrary to what is common in statistics. 

An essential point of the statistical paradigm is that, before the experiment 
is done, the parameter 9 is unknown; afterwards it is as a rule fairly accurately 
determined. In this way the focus is shifted from what the value of the parameter 
'is' to the knowledge we have about the parameter. In a physical context this 
can easily be made consistent with the point of view expressed by Niels Bohr 
PU] : 'It is wrong to think that the task of physics is to find out how nature is. 
Physics concerns what we can say about nature.' 

In several cases the statistical model may be too rich for the parameters to be 
identified, but even so the parameters may be of interest. On certain occations 
there may be a choice with respect to which parameter to estimate. For instance, 
assume that we want measure some quantity with an apparatus which is so 
fragile that it is destroyed after a single measurement. We may model the 
measured values to have an expectation /i and a standard deviation a, perhaps 
even a normal distribution with these parameters. A single measurement gives 
an estimate of fi. The standard deviation may be thought to be possible to 
estimate by dismantling the apparatus, again destroying it. 

Important ingrediences of the paper seen from a statistical point of view are: 
Model reduction, symmetry, complementary parameters. 

The last of these concepts is particularly important, and extends the common 
statistical way of thinking. In a model of a particle we can imagine that it has 
a theoretical, definite position 9^ = ^ and a theoretical momentum 6^ — tt, but 
there is a limit to how accurate these parameters can be determined. From our 
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point of view, this is conceptually not much more difficult than the following: 

A given patient has (expected) recovery time 9^ if treatment 1 is used and 9'^ 
if treatment 2 is used. The term expected here must be interpreted in some loose 
sense, not necessarily with respect to a well-defined probability model. Like all 
parameters, 9^ and 9^ can be estimated from experiments, but it is impossible 
to estimate both parameters on the same patient at the same time. In statistics 
this and similar problems are solved by investigating several units, here patients, 
assuming the same parameters for all units. Such an assumption is relevant to 
ordinary statistical investigations, where the purpose is to say something of 
importance for a large population of patients, and hence to future patients. In 
quantum physics the parameter must be connected to a single particle or to 
a small number of particles, and then the analogy with a model for a single 
patient becomes of interest. As will turn out, the only possibility then of being 
able to infer something about such a parameter, is to make stringent symmetry 
assumptions. 

3 Statistical models and groups. 

Define the parameter 9 of an experiment as above, and let a symmetry group 
G be defined on the parameter space. This group G will be kept fixed, being 
thought of as a part of the specification of the model. The basic requirement 
for choosing G is that the parameter space should be closed under the actions g 
of the group: 9 9g, where it is convenient to place the symbol for the group 
element acting on 9 on the right. (This will lead naturally to the right invariant 
measure as the non-informative prior on the parameter space, a solution that 
was argued for in |23 to be the best one from several points of view.) 

Throughout this paper, we will regard groups as transformation groups act- 
ing on concrete spaces, primarily the parameter space, but also the space of 
observations. In the mathematical literature this is called group actions, which 
can be regarded as a group of automorphisms of a given space. 

Sometimes in statistics a symmetry group G on the sample space is defined 
first, and then G is introduced via the statistical model by defining P^^ by 

P'^^A) = P'^{Ag-^) for sets A. (1) 

Then the connection from G to G is a homomorphism: 

91,92 ^ gi, §2 implies gig2 ^ gih- (2) 

The concept of homomorphism will be fundamental to this paper. It means 
that we have very similar group actions: The identity element, inverses and 
subgroups are mapped as they should from G to G, i.e., the essential structure 
is inherited, li g —^ e implies g = e (identity elements), the homomorphism 
will be an isomorphism: The structures of the two groups are then essentially 
identical. If in addition a one-to-one correspondence can be established between 
the spaces upon which the groups act, everything will be equivalent. 



6 



4 Metaparameters. 



The model description above demands in principle that the parameter should 
be estimable from the available data. For models involving only one or a few 
units, this is typically not realistic at all, as already noted. 

A simple example is the location and scale parameters /i, cr in a case where 
only measurement on a single unit is possible. As another example, assume 
that two questions are to be asked to an individual, and we know that the 
answer will depend on the order in which the questions are posed. Let (6*^,6'^) 
be the expected answer when the questions are posed in one order, and {6^ , 9^) 
when the questions are posed in the other order. Then = {6^,9^,9^,0'^) 
cannot be estimated from one individual. Many more realistic, moderately 
complicated, examples exist, like the effect of treatments on a patient where 
only one treatment can be given, or behaviourial parameters of a rat taken 
together with parameters of the brain structure which can only be measured if 
the rat is killed. 

In all these cases the situation can be amended through investigating several 
individuals, but this assumes that the parameters are identical for the different 
individuals, a simplification in many cases. Note that the ordinary statistical 
paradigm in principle assumes an infinite population of units with the same 
parameters. This will not be assumed in the present paper. 

When considering these cases where (j) cannot be estimated from any exper- 
iment on the given units, we may call $ = {4>} a metaparameter space rather 
than a parameter space. We nevertheless insist that modelling through meta- 
parameters can be enlightening, in the cases mentioned above as in other cases. 
In particular this may be useful if one models cases where one has the choice be- 
tween several measurements, as one usually will have in quantum mechanics. As 
will be discussed below, by choosing a particular experiment in a given setting, 
what one can hope for, is to be able to estimate a part of the metaparameter. 

Sometimes it will be convenient to use the term 'parameter' both for meta- 
parameters and ordinary parameters, and then use the specific term 'estimable 
parameter' for the latter. The term estimability, as used here, is the same as in 
statistics 22 : 

Definition 1. 

A parameter 9 is unhiasedly estimable if there exists a model P^(-) for that 
experiment and a random variable y of the experiment such that 



In that case we say that the parameter 9 can be estimated unhiasedly by the 
observator y. More generally, a parameter is estimable if there is a one-to- 
one function of 9 which is unbiasedly estimable. 

The last generalization is made to ensure that a one-to-one transformation 
of a parameter S,(9) is estimable whenever 9 is estimable. 
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The metaparameter space 4> can in general have almost any structure; we will 
assume here that it is a locally compact topological space. We will also assume 
that there is a transformation group G acting on $, and that G satisfies certain 
weak technical requirements (see [2]) so that $ can be given a right invariant 
measure v, satisfying v((d4>)g) = v{d4)). The invariant measure is unique for 
transitive groups, i.e., groups having the property that for each 4>i,4>2 there 
exists a g such that (pig = (j)2- In general the invariant measure is unique on 
orbits, i.e. sets of the form {(j)g : g G G}. It must be supplemented by a measure 
on the orbit indices in order to give a measure on the whole space $. 

5 Choice of question. 

We will propose here a general procedure in physics quite similar to that used 
in good applied statistics: After the situation has been clarified in terms of a 
parametric structure, the first issue is to choose what we are interested in, and 
then which experiment to perform. This will then first lead to a focus parameter 

There are usually many questions that can be investigated in a given set- 
ting. Typically the different such questions are addressed performing different 
experiments on the specific part of reality in question. 

Let A be the set of such questions. This gives for each a & A a focus 
parameter 9°- = 0°'{(j)) (possibly a vector). Depending upon the circumstances, 
this may still be a metaparameter. To achieve a crisp probability model, and 
through that hopefully an estimable parameter, some model reduction may have 
to be performed; see Section 7. 

When a group G is defined on the original (meta)parameter space an 
important property of the focus parameter 0" is if it is a natural function (</>), 
that is, satisfying: 

If 6i(0i) = 6{<j)2) then 6'(0ig) = e{(l)2g) for all g e G. 

The most important argument for this restriction is that it leads to a uniquely 
defined group G on the image space 9 of 9{(j)): 

{Ogm = 0{cj,g). (3) 

Some additional arguments for the requirement of naturalness are given in 
|21l 1231 . Among other things certain paradoxical conclusions related to Bayes 
estimation are avoided if focus parameters are required to be natural functions. 
Thus what we do here, is to demand nature to avoid certain paradoxes. 

Trivially, the full parameter 6 = cf) is natural. Also, the vector parame- 
ter [Oi,. . . ,9k) is natural if each 9i is natural. These two facts indicate that 
in addition to the requirement that the experimental parameter 0" should be 
natural, one must typically also require that it is not too big. If necessary for 
estimability, model reduction must be done, as discussed below. This depends 
upon the context of the experiment. 
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As a simple illustration of a group connected to a parameter space or the 
metaparameter space, look at the (meta)parameter cf) = (/i, a) with the trans- 
lation/scale group a) ^ {a + bfi, ha) where 6 > 0. The following one- 
dimensional parameters are natural: /i, a, fi^, fi + a, fi + 3cr, and if a focus 
parameter is asked for, all these give valid candidates. 

On the other hand, the following parameters are not natural, and would 
according to McCuUagh ^21 lead to absurd focus parameters under this group: 
IJ, + cr^, cre^, tg(//)/sin(cr). 

A further example is given by the coefficient of variation cr//i. This is not 
natural since the location part of the transformation does not make sense here. 
But it will be natural if the group is reduced to the pure scale group (/j,, a) 

ba), b> 0. Going back to the 'absurd' examples above, we also see that the 
first two of them will be natural if the group is reduced to the pure translation 
group (/i, a) ^ {a + (i, a). This points at an important general principle: 

If a focus parameter 9°'{(j)) is not natural with respect to the basic group G, 
then take a sub-group G"' so that it becomes natural with respect to this subgroup. 

One can easily show in general [23 that there exists a maximal subgroup 

having the property that 0°(<^) is natural with respect to this group. Then 
this induces a group G" on 0° = 6'°($), and there is a simple homomorphism 
from G° to G" determined as in 

A simpler situation where the theory of this paper also applies in principle, 
is when the parameter set {9'^; a G A} is given at the outset, together with 
groups G". Then one may just define (j) = {9°-- a £ A), that is, the vector of all 
0" and define G by = (6'"5"; a e A). 

6 Context. 

Any experiment is done in a context, that is, for some given experimental units, 
some preconditioning done on these units, some assumptions explicitly made 
and verified before the experiment and some physical environment chosen for 
the whole experiment. 

In connection to quantummechanical experiments, the concept of context 
has been discussed at some length by Khrennikov [^SJ 121] and others. Much of 
the qualitative part of this discussion can be transferred to any experimental 
situation and any set of experiments, but by using a special argumentation, 
Khrennikov also discusses the two-slit experiment from a contextual viewpoint. 

A context may include the knowledge of parameters estimated with certainty 
in earlier experiments. 

As discussed later, parts of the context may be formed by conditioning upon 
random variables with distribution independent of any parameters, in particular 
upon the experiment chosen. 

In general, the context may consist of a complex of conditions upon which 
all probabilities for potential experiments depend. 

As already discussed in the Section 2 and 4, for a given context, certain 
pairs of experiments are incompatible: Only one of these experiments can be 
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performed in a given context. Niels Bohr used the concept of complementar- 
ity in a sense closely related to this. Many physicists have followed Bohr's use 
of the word complementarity even though this is somewhat problematic: The 
same word is used in a different meaning in psychology and in color theory. 
Some theoretical physicists, among them L. Accardi, argue that the word com- 
plementarity should only be used for potential experiments that are maximally 
incompatible in some precisely defined sense: For two discrete parameters, in 
a state determined by fixing one of them, the posterior probability distribution 
of the other should be uniform over its values. By taking limits, a similar no- 
tion can be defined for continuous parameters, even parameters like expected 
position or momentum, taking values on the whole line. 

7 Model reduction. 

Nearly every useful model of reality is a simplification. Even more important: A 
simplification may be forced upon us by the requirement that it shall be possible 
to perform a meaningful experiment in a given context. 

Definition 2. 

We say that the parameter 9 has been reduced to the parameter X if we 
first have A = r]{0) and possibly also reduce the range © of 6, and then let the 
statistical model be reduced to only depend upon A. 

By suitably defining A, this includes many cases, like equating parameter 
values for different individuals, letting A be a discretized version of or letting 
A be by a selected set of orbits of 6 under the group G. 

We do not have any ambition here of formulating a complete theory of model 
reduction. 

In this paper every model reduction corresponding to a focus parameter 0" 
is done by reducing the numbers of orbits G"^ as acting on 0°. In the extreme 
case the parameter space is reduced to a single orbit. There is in fact a strong 
argument for this policy: Within orbits, an optimal parameter estimator exists 
in the socalled Pitman estimator, which is the Bayes estimator with the invariant 
measure as prior. Hence the only room for useful model reduction, that is, model 
reduction leading to better predictions, is in the orbit index. 

A model reduction from 0^ to A° via the orbit index, will always be natural, 
which is easy to see. Also, the same group can be used on the image of A" as 
was used on the image of 9"'. After having reduced the model parameter from 
9"' to A", we now assume that there is a measurement model 

P'°(-) 

for the potential observations. 

Which model reduction that is chosen, will also depend on the experimen- 
tal basis, i.e., on parts of the context. We will discuss this later in a group 
representation setting. 
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A relevant point here is that the same kind of model reduction has turned 
out to be of interest in a completely different setting: One main purpose of 
model reduction in statistics is to improve the predictive power of a model with 
too many parameters. In particular, the strategy of reducing a model through 
the orbit index of a suitable group, has proved to be very useful in the field of 
chemometry. Specifically, such a model reduction in a random regression model 
with the rotation group on the parameter space of regression coefficients and 
a;-covariance matrix was discussed in |27]. This turned out to give relations to 
certain chemometric regression methods which have proved to be useful, and 
which have originally been motivated in more intuitive ways. 

8 One particle model. 

The statistical modelling concepts introduced so far, are rather straightforward, 
but they do have implications, as the following example shows. Consider a 
particle with a theoretical spin (j) given as some vector with norm 7, and let the 
group G be the group of rotations of this vector. Assume a basic contextual 
setting such that the most we can hope to be able to measure is the angular 
momentum component 0°'{4)) — 7C0s(q;) in some direction given by a unit vector 
a, where a is the angle between (j) and a. Here a can be chosen freely. Given a, 
and given the measurement in the direction a, the rest of the metaparameter 
will be totally unavailable. 

The function 9"'{-) is seen to be non-natural for fixed a: Two vectors with 
the same component along a will in general have different such components after 
a rotation. The maximal group G° with respect to which 9°'{-) is natural, is the 
group of rotations of the unit vector around the axis a possibly together with a 
180° rotation around any axis perpendicular to a. 

The group induced by on the image space for 0° is called G°. This group 
has several orbits: For each k E (0,7], one orbit is given by the two values 
0°' = K and 6°" = —k. In addition there is an orbit for k — 0. 

We want in general that any reduction of the parameter space should be to 
an orbit or to a set of orbits. This gives the maximal possible reduction A" of 
0" to a single orbit {— k, k}. The value of k is in a certain sense arbitrary; we 
take K = 1 to conform to the usual notation for spin 1/2 particles. Anticipating 
concepts that will be introduced later, this indicates that the corresponding 
state space Ti must be two-dimensional and invariant with respect to G"^. Since 
this should hold for all G", it follows that H should be invariant with respect to 
the whole rotation group; thus we are lead to the usual Hilbert space for spin 
1/2 particles, a two-dimensional irreducible invariant space under the rotation 
group. 

The corresponding Hilbert spaces for particles with higher spin quantum 
numbers are taken in the usual way as irreducible representation spaces of the 
rotation group. Note that these will require several orbits of the group G°. As 
is well known, these representations can be indexed by the norm of the reduced 
spin vector. 
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9 The EPR situation and Bell's inequality. 



Consider next the situation of Einstein, Podolsky and Rosen ^ as modified by 
Bohm, wliere two particles previously have been together in a spin state, so 
that they - in our notation - later have opposite spin vectors (j) and —cf). In 
ordinary quantum mechanics this is described as an entangled state, that is, 
a state for two systems which is not a direct product of the component vec- 
tors. According to our programme, we will stick to the parametric description, 
however. 

As pointed out by Bell pS) and others, correlation between distant measure- 
ments may in principle be attributed to common history, but apparently not so 
in this case, where Bell's inequality may be violated. 

Assume that spin components A"^ and are measured in the directions given 
by the unit vectors a and b on the two particles at distant sites A and i3, where 
the measured values A"^ and n'' each take values ±1. Let this be repeated 4 
times: Two settings a, a' at site A are combined with two settings b, b' at site 
B. The CHSH version of Bell's inequality then reads: 

E(A"/i'') < E(r A''') + E(r'/i'') + E(A"'/i''') + 2. (4) 

In fact we can easily show the seemingly stronger statement: 

A'A^' + A"'/t^ + A"'/i^' - A°/i^ = ±2 (5) 

whenever the A-estimates take the values ±1: All the products take values ±1 
and A°/i'' is the same as the product of the first three similar terms. Listing the 
possibilities of signs here, then shows that the lefthand side of (O always equals 
±2. 

As is well known, the inequality I0J) can be violated in the quantumme- 
chanical case, and this is also well documented experimentally. There is a 
large literature on Bell's inequality. In recent years there has been a discus- 
sion |29[ 1301 1^ on whether or not it is possible to break the inequality by a 
computer experiment. Various possible positions that may be held on the vi- 
olation of the inequality are discussed in |32) . One such position is that there 
always will be a loophole in real experiments [33] such that the experimental 
violation still can be explaned by a local realistic model. 

The following is an important part of our philosophy: Quantum theory 
is a statistical theory, and should be interpreted as such. In that sense the 
comparison to a classical mechanical world picture, and the term 'local realism' 
enherited from this comparison is not necessarily of interest. We are more 
interested in the comparison of ordinary statistical theory and quantum theory. 
Our aim is that it in principle should be possible to describe both by essentially 
similar ways of modelling and inference. Thus it is crucial for us is to comment 
on the transition from Q to from this point of view. 

As pointed out by |31j . for any way that the experiment is modelled by 
replacing the spin measurements by random variables, there is no doubt that 
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this transition is valid, and the inequaHty must necessarily hold. The reason 
is simple: The expectation operator E is the same everywhere. 

Now take a general statistical inference point of view on any situation that 
might lead to statements like and Q. Then one must be prepared to take 
into account the fact that there is really 4 different experiments involved in these 
(in)equalities. The A's and /t's are random variables, but they are also connected 
to statistical inference in these experiments. What we know at the outset in 
the EPR situation is only that some metaparameter zLcj) (possibly together with 
other metaparameter-components) is involved in each experiment. Going from 
this to the observations, there are really three steps involved at each node: The 
components 9{(j)) are selected, there is a model reduction A = 77(0), and finally 
an observation A. Briefly: A model is picked, and there is an estimation within 
that model. 

10 Statistical models in connection to Bell's in- 
equality. 

Turn to general statistical theory: According to the socalled conditionality prin- 
ciple |34j . a principle on which there seems to be a fair amount of concensus 
among statisticians, inference in each experiment should always be conditional 
upon the experiment actually performed. 

A motivating example for this is the following, due to Cox [3S]: Let one 
have the choice between two measurements related to a parameter 9, one having 
probability density fi{y,0), and the other having probability density f^iy^d). 
Assume that this choice is done by throwing a coin. Then the joint distribution 
of the coin result z and the measurement y is given by 

c(z)h{y,e) + {l^c{z))h{y,e), 

where c(z) = 1 if model 1 is chosen, otherwise c{z) = 0. Should this joint dis- 
tribution be used for inference? No, says Cox and common sense: All inference 
should be conditional upon z. 

In particular then, the conditionality principle should apply to the distribu- 
tion of point estimators. Taking this into account, it may be argued that at 
least under some circumstances also in the macroscopic case, different expecta- 
tions should be used in a complicated enough situation corresponding to 
and then the transition from ||SJ to is not necessarily valid. 

This is dependent upon one crucial point, as seen from the conditionality 
principle as formulated above: When one has the choice between two experi- 
ments, the same parameter should be used in both. How can one satisfy this 
requirement, say, in the choice between a measurement at a or at a'? As for- 
mulated above, the relevant parameters are A° and A" for the two experiments 
under choice. 

Here is one way to give a solution: Focus on the Stcrn-Gerlach apparatus 
which measures the spin. Make a fixed convention on how the measurement 
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apparatus is moved from one location to the other. Then define a new parameter 
A which is -1 at one end of the apparatus and -1 at the other end. By using 
A as a common parameter fo both experiments under choice, the conditionaUty 
principle can be applied, and I^J does not follow from |SJ|. 

As I see it, this argument can be related to the socalled chameleon effect, 
which is advocated in several papers [221 EOl ISHI by Accardi. From this point 
of view the effect may look rather simple and uncontroversial, but note that it 
here is coupled to a rather deep general principle of statistics. 

The analysis above took as a point of departure a reduced model, where 
the model reduction was related to the rotational group. If space and time are 
included, more parameters must be introduced, and the full Poincare group or 
its non-relativistic relative is needed. Under this assumption it was shown by 
Volovich 37 that local realism is restored. According to the present discussion, 
this might be interpreted to mean that, when space and time are included among 
the parameters, there is no chameleon trick which can construct a common set 
of parameters for the two experiments under choice. A more detailed discussion 
of this and of the related loophole theme |32[ \'6'6\ 05] is beyond the scope of the 
present paper. 

My crucial point is that the violation of the Bell inequality is not by necessity 
a phenomenon that makes the quantum world completely different from the rest 
of the world as we know it. Regarding the term 'local realistic', I don't mean to 
imply that any macroscopic phenomenon are nonlocal if this term is relevant. 
But if 'realistic' means that a phenomenon always can be described by one single 
model, this may be a too strong requirement. 

A more explicit argument for the correlation between spin measurements, 
using the prior at A connected to model reduction there, may be given as follows: 
At the outset the (meta)parameter cj) is sent to A and —cj) to B. This may be 
interpreted to mean that much common information is shared between the two 
places. The vector </> is capable of providing an answer to any question a € A: 
Is the spin in direction a equal to +1 or to -1? 

The observer at A will have a prior on (j) given by a probability 1/2 on 
A" = +1 and a probability 1/2 on A° = —1, where 0" is the cosinus of the angle 
between a and 0, and A" the corresponding reduced parameter taking values 
±1. This is equivalent to some prior on the vector (f) which has probability 
1/2 of being a + e and 1/2 of being —a + e, where a is a unit vector, and e is 
some random vector perpendicular to a which is independent of A° and has a 
uniformly distributed direction. Note that this reasonable prior on (j) is found 
by just making the decision to do a measurement in the direction a at A. 

Now let one decide to make a measurement in the direction b at the site B. 
Let 6^ be a unit vector in the plane determined by a and b, perpendicular to 
b. Then, taking the prior at A as just mentioned, (f> will be concentrated on 
a + e = bcos{u) + b^sm{u) + e and —a + e, where u is the angle between a and b. 
Hence the component of this prior for —(j) along b will be — A''cos(u) — e-b, where 
the first term takes two opposite values ±cos(m) with equal probability. The 
expectation of this prior component will be 0, more specifically, the component 
will have a symmetric distribution around 0. 
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Conditionally, given A°, this prior component will have an unsymmetric 
distribution, and there is a uniquely distributed parameter (x^ taking values ±1 
such that E(yu''|A°) — — A"cos(m). So, using parameter reduction to ±1 at B, 
this is the distribution obtained from the model assuming a measurement in 
direction a at A. There is noe action at a distance here; all information is in 
principle contained in the measurement (/>. 

Turning now to estimation, in general an unbiased estimator in statistics is 
an observator, i.e., random variable whose expectation equals the parameter in 
question. Let now A" and jl^ be unbiased estimators of A° and respectively, 
so that E(A''|A") = A" and = Later we shall show the existence 

under reasonable assumptions of such estimators taking the correct values ±1. 
Then 

E(A'^A'') = miX'^^'m = E(E(>|A-)E(E(A''|/)|A"))) . 
= E(A°(-A'^cos(u))) -cos(m). 

This correlation also determines the joint distribution of the two random 
variables A" and //''. 

The discussion above was partly heuristic, but it leads to the correct an- 
swer, and it seems to be a way to interprete the information contained in the 
metaparameter 0. 

It is also important that the above discussion was in terms of a reasonable 
parametric model. Parameters are distinctly different from random variables, in 
particular from random variables located in time and space. Much of our daily 
life imply the use of mental models, and also some form of model simplification. 
Quantum theory can in some sense be said to have analogies also to this world, 
perhaps more than to the world of classical mechanics. 

The limitation of the way of thinking demonstrated in this section is twofold: 
First, the basic group need not be the rotation group in general. Secondly, it 
may not be strightforward to generalize th reasoning to the case with more than 
two eigenvectors. Hence we wil start to build up the apparatus which we feel is 
necessary to treat more general cases. Ultimately, it will need to the ordinary 
formalism of quantum theory. 

11 Group representation and invariant spaces. 

We assume the basic elements of group representation theory to be known; 
for simple treatments with physical applications see [391 14U) , a mathematical 
treatment of finite groups is given in 41 and more advanced discussions are 
found in j42[ 143) . As is well known, group representation is a very useful tool 
in applications of quantum mechanics. Here, the formal apparatus of quantum 
mechanics will be partly derived by considering these representations. A group 
representation is a homomorphism of a group onto the transformations on some 
vector space. In simple cases one may think of the latter as a group of matrices 
under multiplication. It is assumed that the vector space will be invariant 
under these transformations. Note then that much of the statements connected 
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to group representation will have an analogy in the group itself, looked upon as 
simply a transformation on a (parameter) space. 

Specifically, the regular representation /7(G) on L^{^, v)^ where v \s a. right 
invariant measure for the basic group G is given by 

U{g)f{^) - fm. (7) 

Explicitly this implies that U{G) is a group of unitary linear operators acting 
on L^(<I>, v). The group property of U{G) is well known and easily verified. The 
same formula {TJ is valid for any subspace F of L^($, ^) which is invariant under 
the group of operators U{G), i.e., such that U{g)f S V when f GV and g & G. 
Also, there is a natural homomorphism from G to U{G) given by g ^ U{g): 

U{gi)U{g2)f{c^) = C/(gi)/(0g2) = /(0<?ig2) = C/(ffi52)/(0). (8) 

This means that G and U (G) have similar structures, which is the first basic fact 
that leads from a general group to the formalism of linear operators so familiar in 
quantum mechanics. All calculations in quantum mechanics are currently done 
on the operator side. As is just indicated, looking at the parameter space and the 
group actions defined there can sometimes lead to a more direct understanding 
of the same phenomena. 

For some fixed a S let now d°'{-) be a subparameter (defined on the 
parameter space or metaparameter space $) which is natural with respect to a 
subgroup G". Let 

= If e L2($, j/) : f{cb) = ne^icb)) for some /}. (9) 

This is obviously a closed subspace of L^($,i/). Furthermore (by the property 
of naturalness) it is invariant under the group of operators U{G°'). 

Alternatively, everything can be reduced to functions of O"^: Look at the 
space V"" = L^{Q°-,D°-) with the operators U{G°-), where g £ G°- \s defined by 
{9°'g){(p) — 6°'{(j)g), where D"' is the invariant measure on 8° induced by ly on 

and U operates on functions f{9) by U{g)f{9) — f{9g). This means that we 
have a sequence of homomorphisms / isomorphisms 

G" ^ G" ^ U{G''){onV'') ^ [/(G"). (10) 

Sometimes a parameter 6^ (natural with respect to G^) will be a function of 
a natural parameter 0" (natural with respect to G"). This fact is equivalent to 
the fact that the corresponding invariant spaces satisfy C V^. In particular 
the space will be the same under any one-to-one reparametrization. Also in 
particular, if A" = ri{d'^) is a model reduction then 

= {/ : /(0) = /(A"(0))} C (11) 

We will call y and parametric invariant subspaces of i^(<&, i^)- Restrict- 
ing group representation to these invariant spaces correspond to first going from 
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the mctaparamctcr (j) to the relevant subparametcr O"', and then to the reduced 
parameter A"*. This gives a relationship between vector spaces on one hand and 
parameters on the other hand. There is a similar relation between group repre- 
sentations (acting on vector spaces) and group actions (on parameter spaces). 

12 Experiment, model reduction and group rep- 
resentation. 

Let now the experimentalist have the choice between different experiments a G A 
on the same unit(s), where the experiment a consists of measuring some y°', 
with J/" = ?/"(w) being a function on some common sample space S, and where 
the measurement process at the outset is modelled with a parameter 9'^. This 
parameter is a part of the model-description of the units, and all the model 
parameters may be seen as functions 0°'{(p) of a (meta) parameter (p. It must be 
emphasized that the mctaparametcr here is only a modelling concept. In ordi- 
nary statistical theory one usually imagines a situation where the model applies 
to a number n of identical units, and one then is free to let n tend to infinity. 
Then it is obvious that every parameter must be imagined to 'have a value'. 
Concretely, this means that the parameter is estimable according to Definition 
1. In the quantummechanical situation we have one or a few units, and the 
(meta)paramcter is explicitly connected to these units. For the latter situation 
it does not then necessarily make sense to let every theoretical (meta)parameter 
'have a value'. This is of course consistent with the Kochen-Specker theorem. 
But note that (f) plays a crucial role in the conceptual description of the situation. 

We use a common sample space S for all experiments a, since this space can 
be imagined in terms of a common measurement apparatus (or apparata). For 
convenience, we will fix one probability measure P on the sample space S. Each 
model induces a new set of probability measures P'' . These probability models 
for the observation may depend on the way the experiment is performed. But 
the parameters 6'^ (and A° below) are assumed to be the same regardless of the 
way the experiment o G ^ is performed. 

In the examples above, we had a situation where the experimental param- 
eters 0°(-) were non-natural with respect to the original group G. As argued 
above, the non-naturalness means that the symmetry group G on the parameter 
space - for the purpose of this particular experiment - must be replaced by a 
subgroup G", typically different for different a. One can show that there always 
for each a exists a maximal such subgroup. Since this is a proper sub-group, 
G" cannot be transitive on the (p-space, nor then the derived group G" on 6'^. 
This then gives us the possibility of a parameter reduction - if this is needed - 
which is done by selecting one orbit or some set of orbits of this group. Such a 
parameter reduction will always be natural (with respect to (5", and then also 
with respect to G"). In general, let \'^{cf)) be the reduced parameter. Since the 
model reduction is done by orbit selection, the same group symbol G" can be 
used for the group acting on its range A". 
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We will shortly consider group representation spaces of the group G". The 
following argument shows that model reduction through orbit selection gives 
a simple transition from V"' to V^, i.e., from the parameter 9"" to the reduced 
parameter A". This gives a new model P"^ , constructed from the original model 

Every function of a parameter 0° can be written as a sum of functions 
of the 0°-parts restricted to the orbits as follows. For orbit Of define fi by 
= fiO") when 9" € Of, otherwise f,{9'') = 0. Then 

i 

But the set of functions of 6'"-parts belonging to orbits is invariant under the 
relevant group G°, hence this implies a splitting into invariant spaces. 

From this, the sum of subrepresentations in question corresponds to a se- 
lected union of orbits, which again corresponds to a selected reduced parameter 
A". The statistical model with this reduced parameter will now be introduced. 

13 Experimental basis and the Hilbert space of 
a single experiment. 

Up to now the discussion has been in terms of models and abstract parameters. 
Now we introduce observations in more detail. We have already stressed that 
we in a given situation have a choice between different experiments/ questions 
a. In this section we will fix a, and hence fix the reduced parametric function 
A° (</>). Given a measurement instument, this will lead to a reduced model . 
We will make some specific requirements - not too strong - on these models 
shortly. The sample space for all experiments will be called S, so that P"** is a 
measure on S. 

In this section we will need to introduce some statistical concepts; for a more 
thorough treatment, see, e.g., |22j. 

A random variables containing all the information of relevance to the partic- 
ular experiment a, is called a sufficient random variable for this experiment, or 
a sufficient observator. The concept of sufficiency has proved to be very useful 
in statistics. Precisely, we have the following 

Definition 3 

A random variable t"" = i"(a;);a; G S connected to a model is called 
sujficient if the conditional distribution of each other variable y, given is 
independent of the parameter A" . 

This means that all information about the parameter is contained in t"" . In 
general, t"' will be a vector variable. A sufficient observator (random variable) 
is minimal if every other sufficient observator is a function of t"". It is complete 
if 

E^°(/i(t'')) = for all A'^ implies hit") = 0. (12) 
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It is well known that a minimal sufficient statistics always exists and is 
unique except for invertible transformations, and that every complete sufficient 
statistics is minimal. If the statistical model has a density belonging to an 
exponential class 

and if c(0) — {c{6) : 6* e 9} contains some open set, then the statistics is 
complete sufficient. 

Recall from Definition 1 that a function ^(A°) is called unbiasedly estimable 
if E'^" {y) = ^(A") for some y. Given a complete sufficient statistics every 
unbiasedly estimable function ^(A°) has one and only one unbiased estimator 
that is a function of t" . This is the unique unbiased estimator with minimum 
risk under weak conditions j^. Thus complete sufficiency leads to efficient 
estimation. 

Definition 4. 

Assume that a complete sufficient observator t"" exists under the model P"^ . 
Let the model he dominated, i.e., such that all are absolutely continuous 
with respect to a common measure P. Then the Hilbert space K,"" is defined as 
consisting of all functions h{t°') such that h{t°') G L^{S,V). 

Let then G be the group acting upon the sample space S. 

Proposition 1. 

Each space /C° is an invariant space for the regular representation of the 
observation group G. 

Proof. If is sufficient under the model P'^ , and G is the group on the 
sample space, then f^g given by t°'g{Ld) = t"'{ujg) is sufficient for all g G G. This 
is proved by a simple excercise using below. Also, if i° is complete, then f^g 
must be complete; hence the two must be equivalent. Therefore /C° is invariant 
under G. 

Consider now the operator A° from L^{S,P) to V° C i')) defined by 

(A'^y)(A'^(0)) = J P^°(*)(da;)y(^) = E^°(*)(y), (13) 

using again the reduced model P^ (duj) corresponding to the experiment a. 
Definition 5. 

Define the space W by W ^ A^JC". 

By the definition of a complete sufficient observator, the operator A° will 
have a trivial kernel as a mapping from /C° onto A°'JC°' . Hence this mapping is 
one-to-one. It is also continuous and has a continuous inverse. Hence is a 
closed subspace of L^($,j/), and therefore a Hilbert space. Note also that W 
is the space of unbiasedly estimable functions with estimators in 1?{S, P). It is 
of course included in the space of all functions of the parameter A". 
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Proposition 2. 

The space is an invariant space for the regular representation of the group 

G". 

Proof. 

Assume that ^{X"") = (y) is unbiasedly estimable. Then also r]{X°-) = 
£,{X'^g) = E"^ ^{y) = E'*' {yg~^) is ubiased estimable, so is an invariant space 
under the regular representation U of G". 

Theorem 1. 

Assume that the spaces /C° and W are finite- dimensional. Then these two 
spaces are unitarily related. Also, the regular representations of the groups G 
and G° on these spaces are unitarily related. 

Proof. 

We will show that the mapping A"" can be replaced by a unitary map in the 
relation W = A'^JC". 

Recall that the connection g g from the observation group to the param- 
eter group G"" is given from the reduced model by 

P-^'"9(B) =P^°(Bg-i). (14) 

For g € G and g € G"^ define Ui{g) — tj{g) as operators on when g ^ g 
as in (|14|) . Here tj is the regular representation of the group G". Then it is 
easy to verify that J7i is a representation of G. Also, if Vi is an invariant space 
for Ui, then it is also an invariant space for JJ . However, the space Vi is not 
necessarily irreducible for U even if it is irreducible for Ui . 

Using the definition and the connection (|14|l between g and g we find 
the following relationships. We assume that the random variable y(-) belongs to 
/C° C L^(5', P) and that tj is chosen as a representation on the invariant space 
W. Then 

C/i(g)A-y(A-) = U {g) A" y = 1 2/(t^)P^°^(rfc.) 
= / y(a;)P^° {du;g-') = / y{Ljg)P^' {du;) = A-U{g)y{X-), ^'^^ 

where U is the representation on /C° given by Uy{uj) = y{Lug), i.e., the regular 
representation on L^(5, P) restricted to this space. 

Thus Uiig)A'' = A'^Uig) on H". 

Furthermore 

U{g) = U{g) = UM = A''U{g)A'' when g ^ g and g ^ g. 

Recall that g ^ g va this setting if {X°'g){(f)) = X°'{(f)g), and that U{g) — U{g) in 
this case. Furthermore, U{g)f{(j)) — f{4>g) when / G and g G G'^. 

By |43) p. 48 for the finite-dimensional case, if two representations of a group 
are equivalent, they are unitary equivalent; hence for some unitary G" we have 

U{g) = G''U{g)G'' t (ig) 
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when g ~> g. 

Since the unitary operators in this proof are defined on and W, respec- 
tively, it foUows that these spaces are related by W = C/C". 

From a statistical point of view it is very satisfactory that the sufficient 
statistic determines the Hilbert space for single experiments. The sufficiency 
principle, by many considered to be one of the backbones of statistical inference 
(e.g. 133] ) says that identical conclusions should be drawn from all sets of 
observations with the same sufficient statistics. It is also of importance that 
this Hilbert space satisfies the invariance properties that are needed in order 
that it can serve as a representation space for the symmetry groups connected 
to each experiment. 

Definition 4 may also be coupled to the operator A"^ and to an arbritary 
Hilbert space K,' of sufficient observators, which may trivially be the whole 
space L^{S,P). Let first 



Then /C" may be considered as the factor space K,' /L"^, i.e., the equivalence 
classes of the old K,' with respect to the linear subspace L" (cf [32], 1.2.101V). 

Here is a proof of this fact: Let ^ e A°-JC' , such that ^(A") = E'*' (y) for some 
y £ /C'. Then y is an unbiased estimator of the function ^(A"). By , Lemma 
1.10, C(A°) has one and only one unbiased estimator which is a function h{t) of 
t. Then every unbiased estimator of C(A°) is of the form y = h{t) + x, where 
a; e L"; this constitutes an equivalence class. On the other hand, every h(t) can 
be taken as such a y. 

14 The quant umtheoretical Hilbert space. 

Our task in this section is to tie the spaces together. We have already as- 
sumed that all the different potential experiments a & A can be tied to one single 
observational space. This is a basic assumption in many macroscopic situations 
also; say the case where one must choose one of several potential treatments 
for one patient. In quantum mechanics one must assume an experimental set- 
ting such that a limitation of the complete sufficient statistics for experiment a 
makes the observational Hilbert space /C" non-trivial. If these sufficient statis- 
tics should be related in some sense, this would mean intuitively that we have 
limited resources in the same way for the different experiments. 

What we assume here, is that the reduced parameter spaces of the different 
experiments have a similar structure. Then the corresponding groups G" can 
be expected to be isomorphic. Precisely, we will assume an inner isomorphism 
as follows: 

Assumption 1. 

Let a, h be any pair of experiments. Assume then that there exists a group 
element gab G G such that the isomorphism between G° and is given by 



^{yelC' : E^°y = for all A"}. 



(17) 



= gatg'^gab- 



(18) 
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Here are some examples where this assumption is satisfied: 

1) Let $ be the real line, let G be the reflection and translation group 
on $, and let g°' be the reflection around a e which together with the 
identity constitutes the subgroup G". Then H18|) holds if gab is the translation 
X ^ X + {h — a). 

2) In the spin 1/2 case $ was a space of vectors, G was the rotation group, 
and G° was the subgroup of rotations around the axis a together with a reflection 
around any axis perpendicular to a. Then (|18|l holds if gab is any rotation 
transforming a to h. 

3) (See 20], p. 24) Let $ be spacetime {^i, ^^2, Csj '''}■ A Lorentz boost in 
the ^ direction with velocity v is given by the transformation (|29|l below. Call 
this tranformation group element g"" , and let g^''^'°' be the corresponding boost 
taking the space time point (77, cr) as an origin instead of (0,0). Let h^''^ be 
a translation in spacetime by the amount {rj^a). Then if (?7,cr) = {^,T)g'", we 
have the three relations 

4) If H18|) holds for transformations on some component spaces, it also holds 
for the cartesian product of these spaces. 

Assumptions 1 will be crucial in connecting the Hilbert spaces Ti" for the 
different experiments. 

From (|18(l follows that any representation U of the basic group G satisfies 

U{gn = UigabWig'Wigaby. (19) 

In particular, this is true for the regular representation U, which satisfies U (g") = 
i7(5") on W and U{g'') = U(f) on H^ 

Since and T-C' are invariant spaces for these respective representations by 
Proposition 2, it follows that we can construct a connection between the spaces 

by 

W = U{gab)n\ (20) 

It follows from (^0)1 : 
Theorem 2. 

a) There is a Hilbert space Ti, and for each a a unitary transformation 
such that = D^n. 

h) There are unitary transformations E'^ such that the observational Hilbert 
spaces satisfy K,"' — E°'TL. 

Proof. 

a) Obvious from (PH]) . 
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b) Prom a) and Theorem 1. 



Theorem 3. 

TL is an invariant space for a representation of the whole group G. 

It follows from Proposition 2 that is an invariant space for the group G°, 
hence for G". (Remember that Uig") = Uig").) 

This can now be extended. Assume that g = 515253, where 51 G G", 52 S G^ 
and 53 G Then 



gives a representation on TC of the set of elements in G that can be written as 
a product 515253 with 51 G G", 52 G G'' and 53 G G°. 

Continuing in this way, using the assumption that the group G is generated 
by {G"; a G A} wc arc able to construct a representation of the whole group G 
on the space Ti. In particular, one must be able to take Ti as an invariant space 
for a representation of this group. 

As an example, the Hilbert space of a particle with spin is always an (ir- 
reducible) invariant space for the rotation group. Together with the fact that 
7i should be an invariant space for the sample space group G, this to a large 
extent determines H, at least if the experimental setting forces H to be as small 
as possible. 

15 Operators and states. 

So, by what has just been proved, for each a the Hilbert space of unbiasedly 
estimable functions of A" can be put in unitary corresponance with a common 
Hilbert space H. We will assume that H is finite-dimensional. Then H" = D^H 
is also finite-dimensional, that is, the space of unbiasedly estimable functions of 
the parameter A" has finite dimension. In this Section we will make a seemingly 
even stronger assumption, namely that the reduced parameter A" itself only 
takes a finite set of values. For simplicity we consider only the case where A" is 
a scalar parameter. 

Recall that W = D"H is a space of functions of the parameter A". Define 
in general an operator S"' on this space by 



for functions / such that the function on the righthand side belongs to H". 
The corresponding operator on H will then be defined by 



Now S"' has eigenvalues A^ with corresponding eigenfunction given by the 
rather trivial function which equals 1 when A° = A^, otherwise 0. This 
implies the important consequence that T" also has eigenvalues A^ with some 



S^fiX") = A"/(A") 



(21) 




(22) 
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eigenvector v^. We will here and in the following for simplicity assume non- 
degenerate eigenvalues. 

Note that is just the indicator for the statement A" = A^. Transferred to 
another space, this means that the eigenvectors v'^ of Ti. also can be interpreted 
as an indicator of the same statement. Thus these vectors can be given the 
following interpretation: A question a £ A has been asked, and the answer is 
given by A'' = A^. This is consistent with the well known quantummechanical 
interpretation of a state vector. 

To follow up, a natural conjecture of the present paper is that, since all 
vectors in 7i are eigenvectors of some operator, all pure 'states', expressed in 
quantum theory as such vectors, can be given an interpretation of this kind. 

The operator T°- may be written 

n 

T'^ = Y.^»t^- (23) 

fc=i 

If A° is multidimensional, a similar statement holds for multivariate opera- 
tors and multidimensional eigenvalues. 

Theorem 4. 

Under the assumptions above, the space W of unbiasedly estimable functions 
is equal to the space of estimable functions. 

Proof 

If T'* is an operator of the form then ^(T"^) = ELi^(-^fcX"fc ^ is 

also a valid operator for any function ^. Thus £,{3'^), having eigenvalues ^(A^) 
also has eigenvectors in Ti" which are functions of A". This means that ^(A") is 
unbiasedly estimable for any £. 

More on the relationship between the foundation of quantum mechanics on 
the one hand and the group representations and their operators on the other 
hand, can be found in [TUj . 

An important question, probably requiring deeper mathematical tools than 
what has been used here, is to generalize the results of this paper to Hilbert 
spaces of infinite dimension. 

16 Born formula. 

To complete deriving the formalism of quantum mechanics from the statistical 
parameter approach the most important task left is to arrive at the Born formula 

P(A'' = A,^|A'' = A^)-KMP. (24) 

Note that here A" and A** are connected to two different experiments on the 
same unit(s). The interpretation of H24|l is as follows: Assume that the system 
is in the state where the reduced parameter corresponding to A" is equal to A^. 
Then imagine that we will perform an experiment whose outcome y depends on 
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the the parameter A**. The hmited experimental basis again enforces a reduced 
model, and in an ideal experiment, where observation almost equals parameter, 
the probability distribution according to quantum mechanics is given by (|24|) . 
where and v^^ are the corresponding eigenvectors. To simplify the discussion, 
we have assumed non-degenerate eigenvalues. 

The proof of (|24|l may in some respect be related to that of Gleason's theorem 
|45|: for a formulation see also for instance 021 ■ The derivation of a probability 
law from Gleason's theorem has been argued for by jJT], but note that this 
theorem departs from a different set of assumptions. 

A more direct approach to (|24l) using decision theory has been given by 
Deutsch (48j. Deutsch started by making reasonable assumptions about how a 
rational decision maker should behave, and then proceded from simple games to 
more complicated situations. Several of his arguments were heuristic. On this 
background the paper has been criticized by Gill 0^1 who added three assump- 
tions: Degeneracy in eigenstates, functional invariance and unitary invariance. 
He conjectured that Born's formula can be proved under these assumptions. The 
assumptions of Gill are satisfied by the theory of the present paper. For instance 
the theory, including the definition H23() is invariant under the transformations 
T°- U'^T°-U and U UH for some fixed unitary U. 

The paper by Deutch ,48j has also been criticized by Finkelstein [30] . Further 
developments are given in The final solution is still wanting. 

17 Basis of quantum mechanics and link to quan- 
tum statistics. 

Our state concept may now be summarized as follows: To the state A''(-) — 
there corresponds the state vector , and these vectors determine the transition 
probabilities as in H24|l . This also implies 

EiX^lX^^XD^vl^T^vl (25) 

where T'' = J2 ■ Similarly 

E(/(A^)|A'^ = A^)=t,f/(r''K, (26) 

where f{T^) = X] ■ Thus the expectation of every observable in any 

state is given by the familiar formula. 

It follows from (|25|1 and from the preceding discussion that the first three 
rules of |46| . p. 71, taken there as a basis for quantum mechanics, are satisfied. 
The 4th rule, the Schrodinger equation, will be discussed below. The present 
approach may also be related to the seven principles of quantum mechanics put 
forward by Volovich j53| . but this will require some further developments. 

In ordinary statistics, a measurement is a probability measure P^{dy) de- 
pending upon a parameter 9. Assume now that such a measurement depends 
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upon the parameter A''(-), while the current state is given by A°(-) = A^. Then 
as in H26|l . for each element dy there exist an operator M{dy) such that 

V[dy\X^ = \l] = vl}M{dy)vt namely M{dy) = ^ P^' (dy)i;^^f . 

j 

As is easily checked, these operators satisfy M[S] = 1 for the whole sample space 
5", and furthermore ^M(Ai) = M{A) for any finite or countable sequence of 
disjoint elements {Ai, A2, . . .} with A ~ UiAi. 

A more general state assumption is a Bayesian one corresponding to this 
setting: Let the current state be given by probabilities 7r(A^) for different values 
of A^. Then, defining a^J2 <K)vtvl\ we get 

F[dy]^tr[aMidy)]. 

This is the basis for much of quantum theory, in particular for the quantum 
statistical inference in |31J; for a formulation, see also |46|. 

18 Isolated systems. 

We now have a rather complete description of a large class of isolated systems. 
These systems can be in a state described by a state vector v'^ in a Hilbert space 
7i, which means that an ideal experiment a € A has been performed with the 
result expressed in terms of the (reduced) parameter as A° = AJ?. An alternative 
interpretation: // experiment a should be performed, the result will be A° — A^ 
with certainty. 

We assume that every state of a completely isolated system is such an eigen- 
state, meaning that it is equivalent to some statement A"^ = A^. 

This may be taken as consistent with the following empirical fact: For real 
quantummechanical systems, all states are eigenstates for variables that are 
absolutely conserved, i.e. charge and mass. Linear combinations of such state 
vectors do not correspond to anything in reality, the well known phenomenon 
of superselection rules. 

Also, when a system has an absolute symmetry g, i.e., identical particles 
under permutation, then the state vector has the corresponding symmetry, that 
is U{g)v'^ = w^. This seems to be related to the Bose- Einstein statistics and the 
Fermi-Dirac statistics. 

The state vector is only defined modulo a phase factor. This can be related to 
a non-trivial stability group for the group G governing the system. In particular, 
this implies the following: Assume that there are parameters ^ that are constant 
under the actions of the group G. Then each state vector describes the same 
state as any vector of the form exp[iF(^)]v^. 

Constructing the joint state vector for a system consisting of several partial 
systems, with symmetries only within the partial systems, follows the receipt 
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19 The Lorentz transformation and Planck's con- 
stant. 



We continue to insist upon keeping the distinction between ideal values of vari- 
ables, that is, parameters on the one hand, and observed values on the other 
hand. In the statistical traditions we will continue to denote the former by 
greek letters. Hence let (■Cij Cs) be the ideal coordinates of a particle at time 
r, and let (tti, 772, tts) be the (ideal) momentum vector and e the (ideal) energy. 
In this section we will not speak explicitly about observations. Nevertheless it 
is important to be reminded of the premise that these quantities are theoretical, 
and that each single of them can only be given a concrete value through some 
given observational scheme. 

This is a general way of thinking which also seemingly may serve to clarify 
some of the paradoxes of quantum theory. As an example, look at the Einstein, 
Podolsky, Rosen 1 situation in its original form: Two particles have position 

and momentum tt* (i ~ 1,2). Since the corresponding quantum operators 
commute, it is in principle possible to have a state where both — and tt^ +Tr'^ 
are accurately determined. That implies that a measurement of respectively 
TT^ at the same time gives us accurate information on respectively tt^. We 
have a free choice of which measurement to make at the 1-particle, but that 
does not mean that this choice in any way influences anything at the 2-particle. 
It only influences which information we extract about this particle. 

After this digression we continue with the single particle situation. As is 
well known from special relativity, the four- vectors ^ — (^17^2,^3,^0 = cr) 
and TT = (tti , , , TTg = c^^e) transform according to the extended Lorentz 
transformation, the Poincare transformation, which is the group which fixes 
c'^drg = c^dr'^ - '^^^ respectively c^mg = c'^e^ - Yf^=i T^f- This is a 

group of static linear orthogonal transformations of vectors together with the 
transformation between coordinate frames having a velocity v with respect to 
each other. Specifically, the coordinate vectors transform according to an inho- 
mogeneous transformation ^ — > -I- 6, while the momentum vector transforms 
according to the corresponding homogeneous transformation tt — > At:. The 
group might be a natural transformation group to link to the eightdimensional 
parameter (p — (1^1, ^2, Cs, ""i, "'27 ""a, c), associated with a particle at some time 
r. However, since the static rotations have representations associated with an- 
gular momenta already briefly discussed, we limit ourselves here to the group 
G of translations together with the pure Lorentz group. 

Consider then the groups Bj given for 5^ e Bj by ^^g^ = + b, other 
coordinates constant, and the groups Vj given by Lorentz boosts of some size v 
in the direction of the coordinate axis of for j — 1,2,3 together with the time 
translation group Bq given by rgQ = t + t. These groups generate G, and they 
are all abelian. Furthermore, the groups Bj commute among themselves, the 
groups Vj commute among themselves, and, since lengths perpendicular to the 
direction of the Lorentz boost are conserved, Bj commute with Vk when j ^ k. 
Finally, the elements of the group Bq commutes with those of Bj {j > 1), but 
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not with those oi Vj (j = 1, 2, 3). 

Disregardmg the time translation group for a moment, it is left to consider, 
say, the groups Bi and Vi together. As is easily seen from the formula, these 
do not commute. The simplest one is which only affects the coordinate ^i. 
Hence is trivially natural with respect to this group. 

From the form of the Lorentz transformation 

and correspondingly for (7ri,e), we see that and tti are not natural when t, 
respectively e are variable. The linear combinations ~ ct, i^i + ct, tti — c^^e 
and TTi + c~^e are natural. One could conjecture that these facts could be useful 
in a relativistic quantum mechanics, but this will not be pursued here. 

Furthermore, — {f : f{(f)) — (</))) for some q} is a subspace of 
L^(<I>, v) which is invariant under the group G\. The representations have the 
form C/i(5)g(ei) = 9(^15) = q{^i + b). But 

°° hk nfc a 

^ fc! d^'i; d^i n 

where Pi is the familiar momentum operator 

h d 

Thus the particular group formulated above has a Lie group representation 
on an invariant space with a generator equal to the corresponding momentum 
operator of quantum mechanics. The proportionality constant h can be argued 
to be the same for all momentum components (and energy) by the conservation 
of the 4- vector. By similarly considering systems of particles one can argue that 
?i is a universal constant. 

In particular then, time translation r ^ r + t has a representation 

exp(^), (28) 

where H is the Hamiltonian operator. 

All these operators can be connected to the fact that that the group G as 
defined above is in fact additive on the right scale. In JO! is pointed out that 
the Lorentz transformation H27|) is equivalent to 

^1 —^ ^icoshr^, + CTsinhr„, ct — > ^isinhr^, + crcoshr„, (29) 

where the rapidity r„ is defined by tanhr„ — v/c. This makes the Lorentz 
boost additive in the rapidity, and all relevant operators and their commuta- 
tion relations can be derived. In particular, the familiar commutation relation 
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XiPi — PiXi — ihl (with Xi being the operator corresponding to position ^i) 
holds under the approximation w v/c. 

The corresponding commutation relation between the time operator and the 
energy operator has also been derived by Tj0stheim |55j in a stochastic process 
setting using just classical concepts. 

Starting from these commutation relations, other representations of this 
Heisenberg-Weil group are discussed in 56 . 

Note that the groups G\ and G\ are transitive in this case, so there is no 
need for - or possibility of - a model reduction. 

In the nonrelativistic approximation, and tti are natural. The basis vectors 
of the Hilbert space for position and basis vectors of the Hilbert space for 
momentum tti are connected by a unitary transformation of the form 

u-(^i) = -1= / exp(^)^.^(a)del. 

The parameters and tti can be estimated by making observations. It is 
natural to impose the translation/ Lorentz group upon these measurements. 
Thus the requirement that the basic Hilbert space also should be a representa- 
tion space for the observation group, is obviously satisfied in this case. 



20 Time development. Schrodinger equation. 

In Section 19 we showed that in the case of a single particle, the time translation 
T ^ T + t had the group representation 

exp( — ), 

where H is the Hamiltonian operator. This can be generalized to systems of 
several particles using an assumption of additive Hamiltonian, and assuming 
that the particles at some point of time were pairwise in contact, or at least so 
close with respect to space and velocity that relativistic time scale differences 
can be neglected. 

Assume further that at time a maximal measurement is done, so that the 
system is in some state vq € Ti.. This means, according to our interpretation, 
that some experiment with reduced parameter A'^ has been done, resulting in 
a value A°. The construction of the Hilbert space H was carried out in the 
Sections 13-14; the starting point was /C^, then = A'^/C", where A" was 
given by 

AVA")=E^"(y). 

In particular, the vector vq corresponds to some vector wq in by some fixed 
unitary transformation, then to uq — A"'wq G A"''H"'. 

Consider now the time translation group element with step t, and assume 
that A" transforms under this group element into a new parameter X°'{t). By 
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the regular representation of the time translation group, this leads to a new 
operator A"'' given by 

iHf 

A^-'V{V) = A'^y{\'^[t)) = cxp( — )A"y(A'^). (30) 

Vectors u{t) in the space A'^-^K."' correspond to vectors ^^u{t) in /C". In 
particular, then, during the time span t, we have that vq develops into 

?■ Hf 

Wt = A- -^eM-—Wwo. 

Finally, if H and the expectation operator A" commute, this gives 

, iHt. 

Wt = exp( ;— )wo, 

n 

and by transforming back from /C" to Ti by another assumption of commutation, 
the state vector at time t will be 

, iHt. 
vt = exp(--^)wo. 

As is well known, the latter equation is just a formulation of the familiar 
Schrodinger equation 

iri^vt = Hvt. 
at 



21 Paradoxes and some further themes. 

Here I include a very brief discussion of some familiar themes from quantum 
mechanics, many of which are discussed in several textbooks. A very recent 
discussion of several points is given in Laloe [13 ■ 

Of course, much more can be said on each theme. Some of the statements 
below are controversial, and many are certainly too simplified. The brief state- 
ments may serve as a starting point of a discussion, however. Our main concern 
is to point out similarities between (our version of) quantum physics and sta- 
tistical modelling. 

The status of the state vector. 

We concentrate on a discrete parameter, typically multidimensional: Sup- 
pose that A° (0) is maximal in the sense that no parameter can be connected to 
any experiment in such a way that A" is a function of this parameter. Then the 
operator corresponding to A°(-) has a non-degenerate spectrum. Thus each 
specification (A"(^) = A^) is equivalent to specifying a single vector w^. We em- 
phasize that A° is a parameter which is specifically connected to the experiment 
(or question) a E A. 

Thus in this case the state can be specified in two equivalent ways. For 
a non-physicist the specification (A"(0) = A^), that is, specifying all quantum 
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numbers, is definitively simpler to understand than the Hilbert vector specifica- 
tion. It is easy to see that every Hilbert space vector is the eigenvector of some 
operator. Assuming that this operator can be chosen to correspond to some 
A", it then follows that the state vector can be written as equivalent to some 
(A'*(0) = A^). A more general statement will include continuous parameters. 

A limitation of this, however, is for a state evolving through the Schrodinger 
equation. While it might be true that vt at each t is equivalent to some state- 
ment {X°'{t) = A^(f)), the parameter X°'{t) will in case necessarily change with 
time. 

Note also, of course, that in the formulae of Section 17 and in related results, 
the state vector is needed explicitly. 

Collapse of the wave packet. 

If we maintain that the role of the wavefunction is to give condensed infor- 
mation about what is known about one or several parameters of the system, 
then it is not strange that the wavefunction changes at the moment when new 
such information is obtained. This collapse due to change of information is well 
known in statistics. 

Superselection rules. 

When parameters are absolutely conserved, for instance charge or mass, 
then also in conventional quantum mechanics no linear combination is allowed 
between the vectors specifying the different corresponding states. This may to 
some extent serve to emphasize our view that a wave function makes sense only 
if it can be made equivalent to some statement X"'{(f)) = A^. 

Wigner's friend etc. 

In principle a statistical model can be formulated for a given system either 
excluding a certain observer (measuring apparatus) in the model, or including 
this observer. There is no contradiction between these two points of view in 
principle. 

Bohr complemetarity. 

A limited experimental basis implies that an experimentalist must choose 
between measuring/ specifying the maximal parameter A^ or the maximal pa- 
rameter A^. It is impossible to specify both. And knowledge of both parameters 
is impossible to have. As has been stated earlier, several macroscopic examples 
of the same phenomenon can be found. 

Schrodinger's cat. 

The mctaparameter can again be imagined to give a complete description 
of the whole system, including the death status of the cat. What can be observed 
in practice, are several complementary parameters A", many of which include 
information of the death status, but some which don't. Included among the 
latter is the state variable developed by registrating the initial state of the 
radioactive source, and then letting some time go. 

Decoherence. 

When a system in a state X'^{(f>) = AJJ enters into an interaction with an 
environment with a large degree of freedom, a state involving a probability 
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distribution over different A^-values will soon emerge. 
Histories. 

Choosing different focus parameters or experiments at each of a sequence of 
time points, we get a history of the kind 

Ai(0) = Ai, A2(0) = a2, A3(0) = Ai.... 

The resulting sequence of quantummechanical states has been discussed by Grif- 
fiths dHI and others. 
Many worlds (Everett (HI). 

There is a need for many models, not many wolds. Each time a choice of a 
measurement is made, a new model is needed. All future sequences of potential 
choices will make a need for many new models. (This is definitely a point which 
needs further specification and clarification.) 

Quantum mechanics and relativity. 

Relativistic quantum mechanics is beyond the scope of the present paper. 
However, it is well known that the use of symmetries, in particular representa- 
tion theory for groups is much used in relativistic quantum mechanics and in 
elementary particle physics. Hence a development of the theory in that direction 
may appear to be possible, and would certainly be of interest. 

It has often been said that it is difficult to reconcile quantum mechanics 
with general relativity theory. While this at the moment is mere speculations, 
it might be that the explanation is just that the transformation groups in general 
relativity are so large that no representation theory exists. (The groups are not 
locally compact.) Thus the formal apparatus of quantum mechanics has no 
place. However, it might still be that the present approach base on models, 
symmetry, focus parameters and model reduction may prove to be useful. 

22 Concluding remarks. 

The two most important arguments for the approach of the present paper, are 
as follows: 

(1) Instead of taking formal, abstract axioms as the point of departure, we 
develop the theory using at each point comparatively reasonable assumptions. 

(2) In principle the theory is an extension of current statistical theory under 
symmetry assumptions. Hence the concepts involved can be related to concepts 
that have proved useful also in other areas of science. 

Having said this, it must also be said that there will be aspects of the theory 
as formulated in this paper, that will need to be developed further. 

Note that the framework of the theory discussed here is very general. The 
metaparameter can be almost anything, and contain any set of different pa- 
rameters 9°'. 

Up to now we have only been talking about the relationship of this theory to 
current physical theory. Another interesting theme is the relationship between 
the theory and current statistical theory. For instance, traditionally statisticians 
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have - somewhat simphfied - been divided into Bayesians, who always use prior 
probabihty measures on the parameter space and frequentists, who never do 
that. From the theme of the present paper it should be possible to clarify 
better when (under symmetry assumptions) measures on the parameter space 
can be useful. Also, the themes of complementarity and of model reduction 
should be developed further in a statistical setting, and the interrelation between 
experimental design and inference which is implicit in this paper, should be 
further explored. 

Finally, traditional statistical theory usually regards the parameter space 
just as a set. The present work indicates that it may be useful to have some 
structure on this set. This conclusion is also supported by applications, and it 
is in line with the recent work by McCuUagh |13|. 
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